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Abstract 

It is shown that geometric optical description of electromagnetic wave with ac- 
count of its polarization in curved space-time can be obtained straightforwardly from 
the classical variational principle for electromagnetic field. For this end the entire 
functional space of electromagnetic fields must be reduced to its subspace of locally 
plane monochromatic waves. We have formulated the constraints under which the 
entire functional space of electromagnetic fields reduces to its subspace of locally 
plane monochromatic waves. These constraints introduce variables of another kind 
which specify a field of local frames associated to the wave and contain some congru- 
ence of null-curves. The Lagrangian for constrained electromagnetic field contains 
variables of two kinds, namely, a congruence of null-curves and the field itself. This 
yields two kinds of Euler-Lagrange equations. Equations of first kind are trivial due 
to the constraints imposed. Variation of the curves yields the Papapetrou equations 
for a classical massless particle with helicity 1. 

1 Introduction 

Quantum mechanics provides exhaustive description of motion of a particle in limited 
scales, when typical length of run is comparable with the wavelength, normally, in atomic 
and sub-atomic ones. When considering motion of a particle in scales which are apparently 
non-comparable with typical wavelength, that exhaustive quantum mechanical picture 
becomes less convenient, and to use it, one passes to asymptotical behavior of incident 
and scattered waves. In astrophysical scales, particularly, when studying deflection of light 
in gravitational field, classical mechanics is evidently more convenient and one prefers to 



consider photon as a classical massless particle drawing a null-geodesic in the space-time. 
This approach to light propagation in curved space-time is quite satisfactory while photon 
is considered as a scalar particle. If, however, its polarization becomes important the 
question arises, how to include it into the classical mechanical description. 

If the wavelength is big enough, i.e., photon momentum is relatively small, the 
commonplace classical mechanical description becomes incomplete. The point is that 
under some conditions spin of the particle becomes comparable with some components of 
its orbital momentum, which usually are regarded as zero. This may happen, for example, 
when describing a light beam incident to a Schwartzschild black hole. Due to the classical 
mechanical considerations each photon of the beam has zero longitudinal component of 
orbital momentum. However, its spin is also longitudinal and, hence, contributes the 
total longitudinal component of angular momentum. Since, on one hand, spin is always 
collinear to momentum of photon, and, on the other hand, gravitational deflection of the 
beam changes the momentum, this description contradicts the conservation law of total 
angular momentum. It is clear that in order to have the correct picture one should take 
spin of photon into account such a way that the total angular momentum conserves. If 
it is done the sum of longitudinal components of orbital and inner momenta is constant, 
thus, the earlier is not zero after deflecting. Therefore, if photon momentum is small 
enough, this effect can change the shape of scattered beam. Thus, there exist situations 
when the well-known corpuscular theory does not work. 

While in case of spinless particle one can make a choice between the usual (corpuscu- 
lar) and wave mechanics due to scales under consideration, in case of particle with helicity 
the earlier does not work, so, the only possibility is to use the latter. The latter provides 
description in terms of special functions, say, Legendre polynomials, which are convenient 
while their powers are not very high. However, in some real situations powers of the 
polynomials are of the order of astronomical distances measured in wavelength units. In 
these situations corpuscular description in which spin of the particle is taken into account 
properly, would be much more convenient. The goal of the present work is construct a 
model of photon with given momentum and helicity ±1 in a curved space-time. 

2 Statement of the problem 

The desired model is assumed to provide certain world line of a massless particle 
which has helicity 1 and, at the same time, description of electromagnetic field which 
everywhere draws a locally plane wave. The main difficulty is that spin of the particle is 
quantized, thus, the desired construction must contain both classical and quantum degrees 
of freedom. The notion of classical particle with quantum spin seems to be one of the 
simplest systems of mixed classical-quantum nature, and attempts to built correct theory 
of this object are lasting for decades [I]. Our task is somewhat wider, because we try not 
only to obtain equation of the particle world line, but also a locally plane wave, in other 
words, to describe propagation of circularly polarized photon in terms of both geometric 
and wave optics. 

An attempt to build such a model was made in the works (21 E] due to the problem of 
light propagation in Schwartzschild space-time. Since electromagnetic field presents in the 



construction a new question arises, how to combine the field in the space-time and helicity 
which must be attached to the unknown world line. In the work j2] electromagnetic field 
was removed by a vector field which was defined on the world lines. This substitution 
made it possible to combine wave and world line under assumption that if the lines are 
found properly then the fields attached to them constitute the entire electromagnetic field 
in the space-time. 

In the present work we revise this approach. Instead of specifying a functional space 
of curves and attaching a vector field to each curve, we assume that the same result could 
be obtained from the pure electromagnetic Lagrangian. The main idea of this work is that, 
after all, both geometric and wave optics should follow from pure electromagnetic theory, 
therefore, the model should be built in the framework of the theory of electromagnetic 
field. To do it, we start with the well-known form of Lagrangian of electromagnetic 
field and restrict the functional space of the field variables with its subspace of the fields 
behaving as locally plane waves. Since restrictions of this sort are known as constraints 
we assume that putting relevant constraints on the Lagrangian leads to special Euler- 
Lagrange equations for the waves in question, and the desired description containing both 
geometric and wave optics follows from it. 

3 Locally plane wave and associated orthonormal frame 

The notion of plane wave in space-time differs from that in space where the wave vector is 
orthogonal to the hyperplanes, cannot be tangent to them. In space-time the wave vector 
is orthogonal to the hyperplanes and, at the same time tangent to them. To see this 
consider flat space-time and Cartesian coordinates {t,x, y,z} in it which are chosen such 
a way that the wave has phase = u(t — z). The phase takes constant values on luminal 
hyperplanes t = z and the wave vector e_ = 2~ 1 / 2 (d t + d z ) tangent to the hyperplanes: 

e_o0 = O. (1) 

At the same time the wave vector is orthogonal to the hyperplanes because, due to pseudo- 
Euclidean metric any null-vector is orthogonal to itself. The wave propagates along this 
vector, therefore e_ must be identified with the vector of velocity and plays the role of 
velocity of photon in corpuscular model. Now we use these two objects to construct an 
orthonormal frame associated with the wave. 

By construction, there exists an isotropic vector e_ tangent everywhere to the sur- 
faces of constant phase, and, therefore, orthogonal to them. We introduce one more 
isotropic vector e + whose direction is arbitrary, requiring only that its scalar product 
with the vector e_ is equal to one. As it is done we can introduce two unit space-like 
vectors e a which are orthogonal to each other and to e±. The four vectors defined this 
way constitute a local orthonormal frame. 

In Minkowski space-time and Cartesian coordinates considered above the space-like 
vectors e a , a = 1, 2 are defined as follows: 



ei = d x , e 2 = d y . 



They are also tangent to the wave fronts and orthogonal to them. These vectors are used 
for specifying polarization of the wave. The four vectors e a , e± where coordinates are 
chosen such that e + = 2~ 1//2 (<9t — d z ) form an orthonormal frame with metric 

< e+, e + >=< e , e >=< e±,e a >= 0, (2) 
< e_, e + >= 1, < e a , e p >= -S a/3 , a, {3 = 1, 2. 

The frame of 1-forms dual to it, is 

Q- = dt + dz, 9 + = dt- dz 
6 1 = dx, 6 2 = dy 

Now, let us pass to a curved space-time and account main features of an electromagnetic 
wave which can be called locally plane and monochromatic. We start with a wave which 
possesses locally a scalar function called phase. Gradient of this function is an isotropic 
1-form, and its (hyper-)surfaces of level are wave fronts, i.e., it is possible to introduce 
local Cartesian coordinates in which the wave can be represented locally the way just 
discussed. This is possible under some special condition, due to which the wavelength is 
much less than typical scales specified by the space-time curvature. Hereafter we assume 
that this condition is satisfied. The field of orthonormal frames associated with the wave 
can be constructed similarly. 

As this is done it remains to fix one detail. The point is that the metric of this frame 
given by the equations (J2J) remains unchanged if one of isotropic vectors is multiplied by 
an arbitrary factor and another one is divided by it. Employing this operation we can fix 
action of the vector e_ on the phase: 

e + o = a), ijj = const (3) 

while, by construction, the equation (0) remains in force. Now, as the orthonormal vector 
frame associated with the wave {e±, e a } is defined, one can find out the orthonormal 
covector frame {8^, 8 a } dual to it and the connection 1-form u a b = 7 ca b 6 |c for the frame. 
Hereafter we assume that this is done, and pass to considering constraints whose imposed 
on electromagnetic field leaves only locally plane waves. 

4 Electromagnetic field with constraints 

If the field in question draws everywhere a locally plane wave there exists a congruence of 
null-curves {x l (s)} which are integral lines of the vector e_. Since this vector is identified 
with the velocity the wave vector is collinear to it, and potential of the field a has no ± 
components, so, we have a constraint given by 

a = A P 9 P , = 1,2. (4) 

The vectors ei, e.2 are chosen to specify polarization of the wave, hence, their span does 
locally the instant (two-dimensional space-like) wave front. Therefore we can neglect 
changes of the potential in their directions. This constraint can be written as 



eioa = e2o« = 



(5) 



where the operator e^o stands for differentiation along the vector ep. 

By analogy with plane waves in Cartesian coordinates the amplitudes can be repre- 
sented in the form 

A p = a f) e i * (6) 

where the contants dp are complex numbers chosen such a way that the wave has left or 
right circular polarization and the function 0, specifies the phase of the wave. In these 
denotions the constraint (0) coincides with the equation (JTJ. Finally, all the constraints 
imposed above ((H |3]|7J) reduce to the following: the 1-form of field potential has only one 
non-zero derivative 

e + o a = iuj a, e_ o a = ep o a = (7) 
where and u do not depend on the parameter s. 



5 Action principle for constrained fields 

The action functional for electromagnetic field is given by the well-known integral 

A = J da A *da = J < da, da > e, (8) 

where a is 1-form of potential of the field, <, > stands for scalar product and e denotes the 
unit 4-form: e = £yjw/4! 6 l A 6 3 'A 9 A 9 l that corresponds to four-dimensional integration in 
the space-time. Straightforward computation of variation of the action (jHJ yields Maxwell 
equations, valid for all possible shapes of electromagnetic field. Our goal is to restrict the 
functional space of the field with subspace of fields which draw everywhere locally plane 
waves by imposing constraints ((HJ OEj)). 

For this end we expand the quadric form < da, da > in the frame {8} as follows: 

< da, da >= (e a o A b ) (e c o A d ) (e a A 6 b , C A 9 d ) 
= (e a oA b )(e c oA d )(e a A9 b ,6 c A9 d ) 

a, c=± 

= Yl (e a oA b )(e c cA d ){6 a ,6 c ) (9 b ,6 d ) 

a, c=± 

= (ea°A,e c oA> (0° ,9 C ) , 

a, c=± 

where A = A@ ep. 

Though most of terms of the expansion are zero, some of them have non-zero variations. 
Substituting the constrained Lagrangian into the action integral (JEJ) yields: 

A = i J {(e+ o A, e + o A) (6+ , 9+) + (e_ o A , e_ o A) (e~ , 0~) + 

+ ((e_ o A , e+ o A) + C.C. ) (o~ , 6 + ) } e. (9) 

where we take into account the fact that for convenience we use complex valued field 
components. As usual, the components A and their complex conjugates A are regarded 



as independent variables. Due to the constraints ((JTJ [3H7J)) the Lagrangian under integral 
(JDJ) can be transformed as follows: 

{(e+ o A , e+ o A) (6 + , 6 + ) + (e_ o A , e_ o A) (o~ ,6~) + 
+ ((e_ o A , e+ o A) + C.C. ) (9- ,6 + ) } = uj 2 (A , A) (o+ , 6+) + iu (A , A) + C.C. . 

The third term can be ignored because, as will be shown below, action of the vector e_ 
annulates the field, consequently, the expression o A , e_ o A^ is product of two zero 
factors, hence, both the third term itself and its variation are identically zero. Though, 
due to the constraint (JTJ) the term A (and A) is equal to zero, we do not ignore it because 
its variation plays important role in the action principle. Thus, finally, the action integral 
has the form 

A = jCe, C= ~uj{u;(A,A) (x,x) + (z(A,A) +C.C. )}. (10) 



6 Helicity of constrained fields 

The action integral is evidently invariant under rotations of the frame in the plane 
formed by the vectors ei and e 2 . This invariance yields some conservation law due to the 
Noether theorem. To find the law we consider the change of the form of the action integral 
under rotations of the local frames specified by an infinitesimal matrix 57] a b {s) defined as a 
function of the parameter s on each curve. Rotation of the frame changes components of 
the vectors A, A but does not change the vectors. Thus, the first term of the Lagrangian 
(jlOJI containing scalar product < A, A > does not contribute variation. At the same time 
the rotation changes derivative of A: 

5A a = 4^-+ %c a i b A c ) = (^] + A c e a , 



ds " J \ ds 



5uj c a = D b (5 % a )6 b 



where the covariant derivative Db(Sr] c a ) is exactly variation of the connection 1-form. 

The first term in the variation of A does not contribute variation of the action due 
to field equations. By the result, variation of the action is 

SC: I {iulA,^^A a e b )\ + C.C. |e = 



D (A a A h -A h A aS 



2i 



e. 



= J {too (5 Vah y A a A b + C.C. }e = Jd[...] + J 2oo6r] ab ^- ( 
Finally, we have: 

DS ab _ Q 
ds 

So, due to the Noether theorem we obtain conserved spin current with single non-zero 
component 

J ab = J +ab = 2S a b, (H) 



where 

S ab = — (A a A b - A b A a ) 

is the spin tensor of the wave. Due to the constraint (jBJ) it is zero for linearly polarized 
waves, and for circularly polarized waves has single non-zero element 

S = ±uj\a a |, 

whose sign depends only on helicity. The only consequence of this result we need is that 
the spin has single non-zero component, and as for its magnitude, we accept its quantum 
value 1 in dimensionless units. Note that the spin is always pointed along the vector of 
velocity, so no special equation is needed for it. This fact provides implementation of the 
Tulczyjew constraint which requires that conversion of the particle momentum with its 
spin is zero 



7 Field equations 



Now we return to the action functional (fTTH) and compute its variations only under small 
variations of the field which has two components A\ and A2. Variation of the first term 
in the constrained Lagrangian is identacally zero because it contains the factor (0 + , 9 + ) 
which is not varied, therefore we ignore it. Variation of the rest part of the action is 



5A = 5 j {(e+ o A, e_ o A) + C.C. } e 
5 (e+ o A) , e_ o A) + (e+ o A, 5 (e_ o A) ) + C.C. 1 e 



o 5 A, e o A 



e 1 o A, e_ o 5 A 



C.C. 



where action of vectors e± is considered as differentiation along the vectors. The next 
step is to extract total derivatives: 

SA = J e + o <^A,e_ o A J) + e_ o (e + o A, 5 A) + C.C. e + 

'^5 A, e + o (e_ o A) + e_ o (e+ o A)^> e 
- I ( 5 A, e + o (e_ o A) + e_ o (e + o A))e. 



Note that due to the metric of the null-frame (J2J combinations like e + o V_ are parts of 
divegence of a vector V, and if the vector has only one component '— ' this expression 
coincides with its divergence. In particular, the combination e{ + o (e_| o /J is exactly 
Dalembert operator applied to a function / which is constant on the wave fronts. Conse- 
quently, the first term in the right-hand side of the equation above is exactly a divergence, 
hence the integral can be taken by parts and variation of the action integral becomes: 



= fd[...]-J{(6A, e {+ o(e_ } oA)) + C.C.) £ 



where the first term reduces to a surface integral and fugure brackets at subscripts mean 
symmetrization. As usual, the surface integral vanishes at infinity and all the rest reduces 
to the following Euler-Lagrange equation: 

e.r + o (e_| o a) = 0. 

Evidently, this covariant equation reduces to Dalembert equation in local Cartesian co- 
ordinates provided that the curves x(s) are locally null straight lines. In fact, any vector 
whose covariant derivative along the curve is zero: 

e_oA = 0, (12) 

and so for the complex conjugate, satisfies this equation. The constrained fields satisfy 
this equation due to the equations (PEJ), consequently, this part of the entire variation 
of the action integral is zero due to the constraints. Though the field equation leaves u> 
an arbitrary function of the phase we restrict our analysis with monochromatic waves for 
which this value is constant. 



8 Papapetrou equations 

It remains to consider the second part of variation of the action integral, produced by 
variation of local frames under fixed field variables. Since the local frames are defined as 
co-moving frames on the congruence of null-curves, it is possible to introduce variation of 
the congruence and derive variation of the frames from it. Small change of shape of a curve 
x(s) causes small change of the tangent vector without change of its length. Consequently, 
variation of the vector e_ is orthogonal to it and to the complementary null- vector e + , 
hence, belongs to the span of the two polarization vectors e^. Since the vector e + is 
also orthogonal to the span, it suffers no change. Therefore, variations of both e + and 
the corresponding covector 9~ are zero. Thus, only variations of the vector e_ and the 
corresponding 1-form 9 + contribute variation of the action integral (|10p. 

The first term in the Lagrangian (JTUJl contains the factor (6 + ,9 + ) = (e_ ,e_) = 
0, therefore its contribution is predetermined only by variation of the vector e_ = x. 
Consider variation of four-dimensional integral 

y/|A| 2 (x,x), 

with respect to variation of the curves. Variation of the factor (x , x) is well-known from 
the variation principle for geodesies [U]. Here we can use the fact that variation of the 
integrand reduces to the scalar product of the vector of variation of the curve <5x(s) and 
the covariant acceleration 

as 

*/£|A|>(*,^ = ^/|A|»(**),£ 

The second term in the Lagrangian (fTUj) has single zero factor A in the scalar product, 
consequently, non-zero contribution to the variation of the action integral appears only 




when varying this factor. The zero factor to be varied is A: 

Variation of the covariant derivative contains derivative on s and the term containing the 
connection 7 6c a . Variation of the first of them does not contrtibute variation of the action 
integral because neither polarization vectors e a nor components of the field change under 
varying the congruence of curves. The only term suffering some change is connection 
7 bc a . Therefore we can write down contribution of this term as follows: 

^5 J cu (i (A, A) +C.C. ) e =-iy (^7 fec M c + C.C. )e 

where the sign minus appears due to the lower index at the field component. Thus, the 
next task is to find variation of the connection 5 r j bc a . 

It is easier to find variation of the 1-form u a b because the variation is to be taken in 
a fixed point under changing the field of frames which is dragged by the vector 5x. This 
variation is, by definition, Lie derivative of the connection 1-form with respect to this 
vector. So, to find variation of the connection 1-form it suffices to take its Lie derivative 
with respect to the vector <5x. Lie derivative of a 1-form A with respect to a vector v is |1] 

£ ff X = dX(v) + d(X(v)). 

Unlike ordinary 1-form the form of connection has components referred to local frames. 
Since the frames are built on the vector of velocity on the congruence its variation causes 
some infinitesimal rotation of the frames. Denote the corresponding matrix of rotation 
T] a c . This rotation transforms components of the connection 1-form and must be taken into 
account. To do it it is necessary to obtain explicit form of this matrix from the dragging 
vector #x. 

Consider a point in the space-time a curve passing through it and the local frame 
and small variation of congruence of curves given by small vector 5x which drags the 
congruence. This dragging replaces the curve passed through this point with the curve 
dragged by the vector and the local frame built in this point is also to be replaced by 
the frame dragged by this vector from a neighboring point. Since, on one hand both the 
frames are orthonormal variation of the frames is small rotation. On the other hand, since 
this rotation is specified by dragging orthonormal frame from a neighboring point, this 
transformation is given by the connection itself, in other words the matrix of rotation r\ a c 
is exactly the value of the form of connection on the dragging vector: i] a b = u a b (5x.). 

Thus, Lie derivative of the connection 1-form with respect to the vector <5x is 

£ s ^ b = doo a b (6x) + d(u a b (5x)) - Va c u c b + r/ c V. 
Subsituting the matrix of rotation we obtain the desirted Lie derivative: 
£ 6 ^ b = doo a b (5x) - u a c (5x)u c b + u b (5x)u a c + d(oo a b (5x)) = 



(du a b (5x) + u b A ^ a c )(5x) + d(u; a b (5x)) = n a b (5x) + d(u a b (5x)) 



where we have obtaind the curvature 2-form Q a b = R cda b 9 c A 9 d . Substituting now this 
into variation of A gives: 



5 A = R dbc a 6x d x b A c e a + x d d d ( %c a 6x b A c )e a = 
R dbc a 5x d x b A c e a + ( lbc a 5x b A c )'e a . 

Variation of A is similar, so after composing the total variation of the second term in the 
action integral we obtain the two terms. One is total derivative of uj{A a r ) bc a 5x b A c ) on s, 
which vanishes on the endpoints of the curves. Thus, the whole of variation of this part 
of Lagrangian is given by another term which is 

where we introduce spin by its only component S 1 2- The Euler-Lagrange equation for the 
curves x(s) coincides with Papapetrou equation: 

uW^=R db ^S db . (14) 



9 Conclusion 

Geometric-optical description of electromagnetic wave in curved space-time, with ac- 
count of its polarization is obtained straightforwardly from the classical variational prin- 
ciple for electromagnetic field. For this end the entire functional space of electromagnetic 
fields is reduced to its subspace of locally plane monochromatic waves. Therefore, first 
of all, the notion of locally plane monochromatic wave in curved space-time should be 
defined. It turns out that waves of this sort exist provided that their wavelengths are 
small compared with scales under consideration. Assuming this, we have formulated the 
constraints under which the entire functional space of electromagnetic fields reduces to 
its subspace of locally plane monochromatic waves and imposed these constraints. These 
constraints not only reduce field variables but also introduce variables of another kind 
which specify a field of local frames associated to the wave and contain some congruence 
of null-curves x l (s). These curves become the main object in the construction because 
it specifies the field of local frames and the field variables A and A are referred to this 
frame. 

Returning to the action principle for the constrained electromagnetic field we have 
Lagrangian (jlOJ) which contains variables of two kinds, namely, a congruence of curves 
x l (s) and the field itself and have two kinds of Euler-Lagrange equations. Equations of 
first kind reduce to local Dalembert equation for the field components A and A which are 
trivial due to the constraints imposed. Variation of the curves yields all the rest equations 
which contain the main result of this investigation. It turns out that the Euler-Lagrange 
equations they yield are exactly the Papapetrou equations for a classical massless particle 
with helicity 1. This equation determines the shape of the 0-curves which, by construction, 
can be considered as world lines of photons with the same wavelengths and helicities. They 
apparently differ from null-geodesics and, thereby manifest influence of spin-gravitational 



interaction on propagation of electromagnetic waves in gravitational fields. Effect of this 
interaction is proportianal to the wavelength therefore this fact can be observed in 
radioastronomy. 
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